A Study on Distributed Frequent Co-occurrence Patterns Algorithms across Multiple Data Streams

نویسنده

  • Jing Guo
چکیده

With the era of big data coming, the data streams are fast, continuous, and unbounded. The real-time requirements of the data streams processing results are very high. A large number of researches have been on Frequent Co-occurrence Patterns across multiple data streams. But those algorithms are centralized, which is worked on a single compute node. The memory of a single compute node and CPU calculation can be limited, which is difficult to deal with the increasing data streams. Using the distributed server cluster is an effective way. However, the centralized algorithm cannot be directly deployed to distributed server cluster. This paper designs a Distributed Frequent Co-occurrence Pattern across multiple data streams to solve these problems. Through a lot of experiments to evaluate it, the algorithm can detect all the objects that meet the conditions in real time, and have good scalability. In order to save memory, this paper also improves the algorithm, and proposes Modified Distributed Frequent Co-occurrence Pattern based on P-condition deletion strategy. The improved algorithm can delete element combinations which can not constitute Frequent Co-occurrence Patterns in the initial stage, so as to effectively save memory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Frequent Co-occurrence Patterns across Multiple Data Streams

This paper studies the problem of mining frequent co-occurrence patterns across multiple data streams, which has not been addressed by existing works. Co-occurrence pattern in this context refers to the case that the same group of objects appear consecutively in multiple streams over a short time span, signaling tight correlations between these objects. The need for mining such patterns in real...

متن کامل

Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows

Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...

متن کامل

Displaying Co-occurrences of Patterns in Streams for Website Usage Analysis

One way of getting a better view of data is by using frequent patterns. In this paper frequent patterns are (sub)sets that occur a minimal number of times in a stream of itemsets. However, the discovery of frequent patterns in streams has always been problematic. Because streams are potentially endless it is harder to say if a pattern is frequent or not. Furthermore, the number of patterns can ...

متن کامل

Leadership Ranking: Bringing Order to Correlated Time Series

Data streams have emerged as an important research area in recent years due to its ubiquitous presence in various application domains including stock market analysis in finance. Therefore, it has become increasingly demanding to develop efficient techniques to discover useful patterns from such data streams. Among many issues, analyzing the correlation between two data streams is an important t...

متن کامل

An Algorithm Based on Horizontal Bit Vectors for Mining Frequent Patterns in Data Streams

Most algorithms for mining frequent patterns in data streams are based on structures like FP-tree, complex mining method makes time and storage space large compared to the bit vector expression. In this paper, an algorithm based on Horizontal Bit vectors for mining Frequent Patterns in data Streams HB-FPS is proposed. HB-FPS is divided into two phases, in online phase, it uses bit vectors to ho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JSW

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2016